OUTLIER DETECTION TECHNIQUE FOR HETEROGENEOUS DATA USING TRIMMED-MEAN ROBUST ESTIMATORS

نویسندگان

چکیده

Context. Fortunately, the most commonly used in parametric statistics assumptions such as normality, linearity, independence, are not always fulfilled real practice. The main reason for this is appearance of observations data samples that differ from bulk data, a result which sample becomes heterogeneous. application conditions generally accepted estimation procedures, example, mean, entails bias increasing and effectiveness decreasing estimates obtained. This, turn, raises problem finding possible solutions to processing sets include outliers, especially small samples. object study process detecting excluding anomalous objects heterogeneous sets.
 Objective. goal work develop procedure anomaly detection sets, rationale using number trimmed-mean robust estimators statistical measure location parameter distorted distribution models.
 Method. problems analysis (processing) containing sharply distinguished, suspicious considered. possibilities methods have been analyzed. A identification extraction outliers caused by measurement errors, hidden equipment defects, experimental conditions, etc. has proposed. proposed approach based on symmetric asymmetric truncation ranked set obtained initial statistics. For reasonable choice value coefficient, it use adaptive procedures. Observations fell into zone smallest lowest ordinal considered outliers.
 Results. allows, contrast traditional criteria identifying outlying observations, Smirnov (Grubbs) criterion, Dixon etc., split analyzed homogeneous component identify assuming their share total unknown.
 Conclusions. article proposes formation supposed zones set, built basis data. It complex procedures establish expected levels form region order dataset. final level dataset refined existing allow checking boundary (minimum maximum) outliers.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust Outlier Detection Technique in Data Mining: A Univariate Approach

Outliers are the points which are different from or inconsistent with the rest of the data. They can be novel, new, abnormal, unusual or noisy information. Outliers are sometimes more interesting than the majority of the data. The main challenges of outlier detection with the increasing complexity, size and variety of datasets, are how to catch similar outliers as a group, and how to evaluate t...

متن کامل

Outlier detection for compositional data using robust methods

Outlier detection based on the Mahalanobis distance (MD) requires an appropriate transformation in case of compositional data. For the family of logratio transformations (additive, centered and isometric logratio transformation) it is shown that the MDs based on classical estimates are invariant to these transformations, and that the MDs based on affine equivariant estimators of location and co...

متن کامل

Outlier Detection Using K-Mean and Hybrid Distance Technique on Multi-Dimensional Data Set

Outlier Detection is a major issue in data mining. Outliers are the containments that divert from the other objects. Outlier detection is used to make the data knowledgeable, and easy to understand. There are many type of databases used now days, and many of them contains anomaly objects, detection or removal of these objects is known as outlier detection. In the proposed work outliers are dete...

متن کامل

Trimmed estimators for robust averaging of event-related potentials.

Averaging (in statistical terms, estimation of the location of data) is one of the most commonly used procedures in neuroscience and the basic procedure for obtaining event-related potentials (ERP). Only the arithmetic mean is routinely used in the current practice of ERP research, though its sensitivity to outliers is well-known. Weighted averaging is sometimes used as a more robust procedure,...

متن کامل

Robust statistics for outlier detection

When analyzing data, outlying observations cause problems because they may strongly influence the result. Robust statistics aims at detecting the outliers by searching for the model fitted by the majority of the data. We present an overview of several robust methods and outlier detection tools. We discuss robust procedures for univariate, low-dimensional, and high-dimensional data such as estim...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Radio Electronics, Computer Science, Control

سال: 2022

ISSN: ['2313-688X', '1607-3274']

DOI: https://doi.org/10.15588/1607-3274-2022-3-5